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1 

DESCRIPTION 

Thermostable DNA Polymerase 
Background of the Invention 

w 

This invention relates to thermostable DNA 
polymerases useful for DNA sequencing. 

Innis et al., Proc. Natl. Acad. Sci. USA 85:9436- 
5 9440, 1988 state that a DNA polymerase from Tfieymus 
aquaticus (termed Tag or Taq DNA polymerase) is useful for 
DNA sequencing. 

Lawyer et al., J. Biol. Chem. 264:6427, 1989 describe 
the isolation and cloning of DNA encoding Taq. The DNA 
10 and amino acid sequences described in this publication 
define the Taq gene and Taq DNA polymerase as those terms 
are used in this application. 

Gelfand et al., U.S. Patent 4,889,818, describe the 
isolation and expression of Taq and state that: 
15 It has also been found that the entire coding 

sequence of the Taq polymerase gene is not required to 
recover a biologically active gene product with the 
desired enzymatic activity. Amino-terminal deletions 
wherein approximately one-third of the coding sequence is 
20 absent have resulted in producing a gene product that is 
quite active in polymerase assays. 

Thus, modifications to the primary structure itself 
by deletion, addition, or alteration of the amino acids 
incorporated into the sequence during translation can be 
25 made without destroying the activity of the protein. 

In the particular case of Taq polymerase, evidence 
indicates that considerable deletion at the N-terminus of 
the protein may occur under both recombinant and native 
conditions, and that the activity of the protein is still 
30 retained. It appears that the native proteins isolated 
may be the result of proteolytic degradation, and not 
translation of a truncated gene. The mutein produced from 
the truncated gene of plasmid pFC85 [containing a 2.8kb 
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Hin dIII- Asp 718 restriction fragment; where the Hin dlll 
site is at codons 206 and 207] is, however, fully active 
in assays for DNA polymerase, as is that produced from DNA 
encoding the full-length sequence. Since it is clear that 
5 certain N- terminal shortened forms are active, the gene 
constructs used for expression of the polymerase may also 
include the corresponding shortened forms of the coding 
seq[uence. 

Summary of the Invention 

10 The invention features a vector which includes 

nucleic acid encoding a DNA polymerase having an identical 
amino acid sequence to that of the DNA polymerase of 
Thermus aouaticus . termed Taq DNA polymerase, except that 
it lacks the N-terminal 235 amino acids of wild-type Taq 

15 DNA polymerase (see Lawyer et al. , supra ) ■ This DNA 
polymerase is designated A Taq (Delta Taq) in this 
application. 

Applicant has discovered that the N-terminal 235 
amino acids of Taq polymerase can be removed without loss 

20 of the DNA polymerase activity or thermal staQsility of the 
polymerase. The A Taq polymerase is still stable to 
heating at high temperatures, but has little or no 5*- 
exonuclease activity as determined by DNA sequencing 
experiments. Because of the lack of the associated 5*- 

25 exonuclease of Taq, theATaq polymerase is significantly 
superior to wild-type Taq polymerase for DNA sequencing. 
TheATaq polymerase can be used with little consideration 
being paid to the length of time or the buffer conditions 
in which the extension reactions of the DNA sequencing 

30 reaction are performed. 

In preferred embodiments, the vector is that nucleic 
acid present as plasmid pWB253 deposited as ATCC No. 68431 
or a host cell containing such a vector. 

In a related aspect, the invention features a 

35 purified DNA polymerase having an amino acid sequence 
essentially identical to Taq but lacking the N-terminal 
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235 amino acids, e.g.,ATaq. By "purified" is meant that 
the polymerase is isolated from a majority of host cell 
proteins normally associated with it, preferably the 
polymerase is at least 10% (w/w) of the protein of a 
preparation, even more preferably it is provided as a 
homogeneous preparation, e.g., a homogeneous solution. 

ATaq appears to be less processive than wild-type 
Tag. More units of DNA polymerase are necessary for ATaq 
to complete a PGR amplification reaction. 

The drawing is a reproduction of an autoradiogram 

formed from a sequencing gel. 

The following is intended to demonstrate an example 
of the method and materials suitable for practice of this 
15 invention. It is offered by way of illustration and is 
not limiting to the invention. 

rons-bructinn of an Kvpressiblft Gene foy — Ta( 

in order to construct the A Tag DNA polymerase gene 

having an N-terminal sequence shown as nucleotide sequence 
20 1, and a C-terminal sequence shown as nucleotide sequence 

2, the following procedure was followed. 

The mutated gene was amplified from 0.25 ug of total 

T>.o>nnnfi amiaticus DNA using the polymerase chain reaction 

(PGR, Saiki et al.. Science 239;487, 1988) primed by the 
25 following two synthetic DNA primers: (a) a 27mer (shown as 

nucleotide sequence 3) with homology to the wild-type DNA 

starting at wild-type base pair 705; this primer is 
designed to incorporate a HSfil site into the product 
amplified DNA; (b) , a 33mer (shown as nucleotide 
sequence 4) spanning the stop codon on the other strand 
of the wild-type gene encoding Tag, and incorporating a 
Hin dlll site into the product DNA. 

The buffer for the PGR reaction was 10 n4S Tris HCl 
pH 8.55, 2.5 im MgGlj, 16 mJ4 (NHJjSO,, 150 ug/ml BSA, and 



30 
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200 taM each dNTP^ The cycle pareuneters were 2* 95*, 2' 
65*, 5» 72*. 

In order to minimize the mutations introduced by PGR 
(Saiki et al., supra > , only 10 cycles of PCR were 
5 performed before phenol extraction, ethanol precipitation, 
and digestion with the restriction enzymes Nco l and 
aindlll. 

The product Nco l and Hin dlll fragment was cloned into 
plasmid pWB250 which had been digested with Nco l, ilindlll, 

10 and calf intestine alkaline phosphatase. The backbone of 
this plasmid, previously designated pTAC2 and obtained 
from J. Majors, carries the following elements in 
counter-clockwise direction from the Pvu II site of pBR322 
(an apostrophe » designates that the direction of 

15 expression is clockwise instead of counter cloclcwise) : a 
partial lac Z * sequence, lac i ' , lac PDVS (orientation not 
known) , two copies of the tac promoter from PL 
Biochemicals Pharmacia-LKB; catalog no. 27-4883) , the T7 
gene 10 promoter and start codon modified to consist of a 

20 Nco l site, a Hin dlll site, the trpA terminator (PL no* 
27-4884-01), an M13 origin of replication, and the amsR 
gene of pBR322. Expression of the cloned gene is induced 
by 0.1 mM IPTG. 

Three of twelve ampicillin resistant colonies arising 

25 from the cloning proved to contain the desired fragment, 
based on their size by toothpick assay (Barnes, Science 
195:393, 1977), their ability to give rise to the 1800 bp 
target fragment by colony PGR, and high levels of 
IPTG- induced DNA polymerase activity in an extract created 

30 by heating washed cells from 0.5 ml of culture at 80 

(fraction I, as described below for an early step in the 
purification method) . The first of these plasmids was 
designated pWB253 and used for the preparative production 
of ATaq. 
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P'T^if i" ?t-^°" °t i^arae A r"""nfcs of Mutant Tag 

one liter of late log phase culture of pWB253 in 
coii host strain X7029 (wild-type £. coli having a 
deletion X74 covering the las operon) was distributed 
among four liters of fresh rich culture medium containing 
0.1 nM IPTG, and incubation with shaking was continued at 
37 -c for 12 hours. The total 5 liters was collected by 
centrifugation and resuspended in Lysis Buffer (20 mJ5 

Tris-HCl pH 8.55, 10 mM MgCl2, 16 m (NHJ2SO4, 0.1% NP40, 

0.1% Tween20, and 1 m EDTA) . To 300 ml of cell 
suspension were added 60 mg lysozyme and the cells were 
incubated at 5-10 'C with occasional swirling for 15 
minutes. The cell suspension was then heated rapidly to 
80 'C by swirling it in a boiling water bath, and the cells 
15 maintained at 80-81 -C for 17 minutes. After this 
treatment, which is expected to inactivate most enzymes, 
the cells were cooled to 37 "C in an ice bath, and 2 ml of 
protease inhibitor (100 n^I PMSF in isopropanol) were 
added. The cells were distributed into centrifuge bottles 
and centrifuged 15 minutes at 15,000 in a Sorval SS-34 
rotor at 2*C. The supernatant was designated fraction I. 

Detergents NP40 and Tween20 were present at 0.01% to 
0.5% (usually 0.1%) at all times and in all buffers and 
solutions to which the enzyme was exposed. Unless 
25 otherwise noted all buffers also contained Tris-HCl and 
DTT as described for the storage buffer below. 

After rendering fraction I 0.25 M in NaCl, ten 
percent Polymin-P (polyethylene-imine) was added dropwise 
to precipitate nucleic acids. To determine that adequate 
30 Polymin-P had been added, and to avoid addition of more 
than the minimum amount necessary, 0.5 ml of centrifuged 
extract was periodically tested by adding a drop of 
Polymin-P, and only if more precipitate formed was more 
Polymin-P added to the bulk extract. Centrifugation of 
35 the extract then removed most of the nucleic acids. 

Chromatography with Bio-Rex 70 (used by Joyce and 
Grindley, Proc. Natl. Acad. Sci. USA 80:1830, 1983) was 



20 
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* • 

unsuccessful. The polymerase activity did not bind at 
all, even when the enzyme was diluted to a salt 
concentration of 0.1 g. The reasons for this lack of 
binding to Bio-Rex 70 were not investigated further at 
5 this time. Rather, the flow-through from Bio-Rex 70 was 
applied to another chromatography medium. 

Successful chromatography was then carried out with 
heparin agarose. The extract, by now diluted to 1 liter, 
was stirred with 50 ml of heparin agarose, and then the 

10 agarose packed lightly into a column. The column was 
washed with 0.1 11 NaCl, and the enzyme eluted with 1 M 
NaCl. The peak of polymerase activity (12 ml) was then 
dialyzed against 50% glycerol storage buffer (50% glycerol 
(v/v) , 100 mU KCl, 20 n4! Tris-HCl pH 8.55, 0.1 n^J EDTA, 1 

15 nOI DTT, 0.5% Tween and 20, 0.5% NP40) . The final yield of 
enzyme was 6 ml at a concentration of 300,000 vinits per ml 
(see below) . An aliquot of enzyme was diluted 10-fold 
into storage buffer, and this working strength enzyme was 
designated KT5* 

20 One unit of enzyme is defined as the amount of enzyme 

that incorporates 10 nmoles of deoxytriphosphates into 
acid insolxible material in 30 minutes at 74 *c. Actual 
assay times were 5 minutes or 10 minutes (with appropriate 
extrapolation to 30 minutes) . Titred full-length Tag DNA 

25 polymerase (AmpliTag; commercially available at 5 
commercial units/ul; one commercial unit is believed to be 
ec[uivalent to one of the units defined in this 
application) was used as a standard. The assay buffer was 
20 m|S Tris-HCl pH 7.8, 8 mU MgClg, 0.1 mg/ml BSA, 5 n^S DTT, 

30 4% glycerol, 100 uJJ each dATP, dTTP, and dCTP, 25 ull 
[jHJdTTP (400 cmp/pmole) , and 160 ug/ml activated calf 
thymus DNA (commercially available; Pharmacia) • 

Sequencing Procedure 

Dideoxy sequencing with the aboveATaq is stuomarized 
35 below. It follows basically the procedxire described by 
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Innis et al., Proc. Natl. Acad. Sci. USA 85:9436, 1988. 
The reactions were performed in microtitre wells. 

In the labelling extension reaction, 24 ixl of Lg mix 
(14 Ml of water, 3 nl of 10 X ATaq buffer (20 mU Tris HCl 
5 pH 8.5 at 25'C, 10 m MgClj, 2mM MnClj, lOn^J isocitrate, 
and 16 mJS ammonium sulphate (the ammonium sulphate may be 
replaced with 50 mM KCl or with water) , 3 Ml 10 mM dTTP, 
1 Ml 10 mM dGTP, and 3 Ml 10 mM dCTP) was added to 3 Ml of 
template (0.5 - 1.0 picomole) and 2 Ml (2 picomole) 

10 primer. These solutions were vortexed, spun down, and 
allowed to anneal by heating to 70 'C and cooling to 45 'C. 
^^P dATP (400 mCi/Mmole; 1 mCi/ml is equivalent to 2.5 fiM) 
was dried down and resuspended in the DNA solution and 1 
Ml A Tag (5 units) added. The solution was warmed to 37 'C 

15 for 45 seconds and chilled on ice. Four reaction aliquots 
were taken from this reaction mixture and placed into 
microtitre wells containing 4 Ml of solution containing 
2m1 4 X dXTP and 2m1 of one of four 4 X dd stock 
solutions. 4 X dXTP consists of 120 (M of all 4 dNTP's, 

20 0.2% Tween 20, and 0.2% Nonidet P-40. Each of the 4 X dd 
stock solution contains either 720 mM ddA, 360 mM ddC, 72 
MM ddG, or 360 MM ddT (or water as a control) . The 4 X 
dXTP and the 4 X dd solutions were premixed at a 1:1 ratio 
so that 4 Ml of "tlie resulting solution could be added to 

25 each of the 4 DNA reaction aliquots. The solutions were 
mixed, the microtitre wells covered with tape and warmed 
to 70'-75'C for ten minutes. (Inciibation may be continued 
for twenty or thirty minutes if desired.) The microtitre 
wells were then dried under vacuum (after removal of the 

30 tape) and 12 m1 of blue formamide buffer added. The wells 
were then heated for thirty seconds to 90 'C and 1/5 of the 
material loaded on a gel. 

The Figure is one example of the results of such a 
sequencing reaction. In the Figure the results obtained 

35 with AmpliTaq (wild-type Taq) DNA polymerase are compared 
with ATaq and Sequenase* T7 DNA polymerase. ATaq has an 
insignificant level of 5 • -exonuclease activity since it 
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gives rise to few or no triplet bands on the sequencing 
gel compared to AmpliTaq DNA polymerase. 

The sequencing procedure eUaove was followed 
identically for all experiments except for the differences 
5 in enzyme, enzyme units added, and incubation times noted 
on the Figure. The incxabation time for the experimental 
results shown in lanes A - D was 3 minutes, in lanes E and 
F it was 10 minutes, and in lane G it was 20 minutes. 
Seguenase^ DNA polymerase was used at lower temperatures 

10 and under the conditions described by Tabor and 
Richardson, Proc. Nat. Acad. Sci. USA 84: 4767, 1987. The 
template was single- stranded DNA encoding an artificial 
gene for scorpion toxin AalT . The primer was the 
•reverse* lac primer which spans the start codon of lac Z 

15 on the vector pBs- (Bluescribe 'minus', from Stratagene) • 

From the Figure it is clear that 5 commercial units 
(approximately 30 units, as defined above) of AmpliTaq DNA 
polymerase in a short extension reaction (3 minutes) gives 
very poor sequencing data; whereas 30 iinits or even 150 

20 units of A Tag gives excellent data, even after a long (10 
or 20 minute) extension reaction, and compares favorably 
with Sequenase^ DNA polymerase. 

Deposit 

Strain pWB253/X7029 was deposited with the American 
25 Type Cultiire Collection, Maryland, on October 4, 1990 and 
assigned the number ATCC 68431. Applicant acknowledges 
its responsibility to replace this culture should it die 
before the end of the term of a patent issued hereon, 5 
years after the last request for a culture, or 30 years, 
30 whichever is the longer, and its responsibility to notify 
the depository of the issuance of such a patent, at which 
time the deposits will be made available to the public. 
Until that time the deposits will be made available to the 
Commissioner of Patents iinder the terms of 37 C.F.R. 
35 Section 1-14 and 35 U.S.C. Section 112. 

Other embodiments are within the following claims. 
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r^mpi^ fgT Submission of P WA and Amino Acid Sequences 

(1) GENERAL INFORMATION: 

(i) APPLICANT: Barnes, Wayne M. 

(ii) TITLE OF INVENTION: THERMOSTABLE DNA POLYMERASE 
5 (iii) NUMBER OF SEQUENCES: 4 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Lyon & Lyon 

(B) STREET: 611 West Sixth Street, Suite 3400 

(C) CITY: Los Angeles 
10 (D) STATE: California 

(E) COUNTRY: U.S.A. 

(F) ZIP: 90017 
(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: 3.5*' Diskette, 1-44 Mb storage 
15 (B) COMPUTER: IBM PS/2 Model 50Z or 55SX 

(C) OPERATING SYSTEM: IBM P.C. DOS 

(Version 3.30) 

(D) SOFTWARE: WordPerfect (Version 5.0) 

(vi) CURRENT APPLICATION DATA: 

20 (A) APPLICATION NUMBER: 07/594,637 

(B) FILING DATE: 05-OCT-1990 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

Prior applications total, 
25 including application 

described below: none 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(viii) ATTORNEY/ AGENT INFORMATION: 

30 (A) NAME: Warburg, Richard J. 

(B) REGISTRATION NUMBER: 32,327 

(C) REFERENCE/ DOCKET: 193/240 

(ix) TELECOMMUNICATION INFORMATION: 
(A) TELEPHONE: (213) 489-1600 

35 (B) TELEFAX: (213) 955-0440 

(C) TELEX: 67-3510 
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TOTAL NUMBER OF SEQUENCES TO BE LISTED: 4 
(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 1 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 80 

5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(E) NAME: SEQ. ID. NO.: 1 

(ii) SEQUENCE DESCRIPTION FOR SEQUENCE ID NUMBER: 1 
10 AACGGTTTCC CTCTAGAAAT AATTTTGTTT AACTTTAAGA 

AGGAGATATA TCCATGGACG 60 
ATCTGAAGCT CTCCTGGGAC 80 
(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 2 
(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 160 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(E) NAME: SEQ. ID NO.: 2 

20 (ii) SEQUENCE DESCRIPTION FOR SEQUENCE ID NUMBER: 2 

GAGGTCATGG AGG66GTGTA TCCCCTGGCC GTGCCCCTGG 
AGGTGGAGGT GGG6ATA6GG 60 
GAG6ACTGGC TCTCCGCCAA GGAGTGAAGC TTATCGATGA 
TAAGCTGTCA AACATGAGAA 120 

25 TTA6CCCGCC TAATGAGCGG GCTTTTTTTT AATTCTTGAA 160 
(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 3 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 

(B) TYPE: nucleic acid 
30 (C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(E) NAME: SEQ. ID NO. 3 

(ii) SEQUENCE DESCRIPTION FOR SEQUENCE ID NUMBER: 3 
GTGTCCATGG ACGATCTGAA GCTCTCC 27 

35 (2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 4 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 33 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(E) NAME: SEQ. ID NO.: 4 

(ii) SEQUENCE DESCRIPTION FOR SEQUENCE ID NUMBER: 4 
GCGAAGCTTC ACTCCTTGGC GGAGAGCCAG TCC 33 
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Claims 

1. A vector comprising nucleic acid encoding a DNA 
polymerase having an amino acid sequencing consisting 
essentially of the amino acid sequence of the Taq DNA 

5 polymerase of Thermus aouaticus lacking the N-terminal 235 
amino acids of Taq DNA polymerase. 

2. The vector of claim 1, said nucleic acid being 
identical to that present in the plasmid pWB253 present in 
the host cell deposited as ATCC No. 68431. 

10 3. A host cell comprising a vector, comprising 

nucleic acid encoding a DNA polymerase having an eunino 
acid sequence consisting essentially of the amino acid 
sequence of the Taq DNA polymerase of Thermus aquaticus 
lacking the N-terminal 235 amino acids of Taq DNA 

15 polymerase. 

4. The host cell of claim 3 deposited as ATCC 

No. 68431. 

5. A purified DNA polymerase having an amino acid 
sequence consisting essentially of the eunino acid sequence 

20 of the Taq DNA polymerase of Thermus acfuaticus lacking the 
N-terminal 235 eonino acids of Taq DNA polymerase. 

6. The purified DNA polymerase of claim 5, said 
polymerase being provided as a homogeneous preparation. 
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